πŸ“š Node [[mini batch_stochastic_gradient_descent_(sgd)]] ⟢ subnode @KGBicheno/mini batch_stochastic_gradient_descent_(sgd)
Nodes contain individual contributions whose filenames match your search. x

mini-batch stochastic gradient descent (SGD)

Go back to the [[AI Glossary]]

A gradient descent algorithm that uses mini-batches. In other words, mini-batch SGD estimates the gradient based on a small subset of the training data. Vanilla SGD uses a mini-batch of size 1.

Loading pushes...

Rendering context...